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METHOD AND CONSTRUCT FOR SCREENING FOR INHIBITORS OF 

TRANSCRIPTIONAL ACTIVATION 

Fjdd of the Invention 

5 This invention relates to methods for screening and identifying inhibitors of 

promoters or inhibitors of transcriptional activation by use of a collision conanict. 
The collision construct provides for increased expression of reporter gene signal in the 
presence of an appropriate inhibitor. This invention, thus, relates to the collision 
construct as well as methods for malting and producing the collision construct, and to 
10 vectors, host cells and kits containing the collision construct 

Background of the Invention 
Past studies of inhibitors of gene function include studies of the inhibition of 
transcription of the gene, as described in Hsu et al, Science 254: 1799-1802 (1991) and 

15 Hsu et ol t Prvc. NatH Acad. ScL USA 90: 6395-6399 (1993). Structural genes have a 
transcription regulatory region that contains one or more sequences, referred to herein 
as response elements, that are capable of binding to certain proteins, referred to herein 
as binding proteins, that activate or repress transcription or facilitate elongation of a 
rnRNA transcript. These binding proteins include transcription factors, such as those 

20 described in Faisst & Meyer (1992), Nudeic Acids Res. 20: 3-26; THE 

ENCYCLOPEDIA OF MOLECULAR BIOLOGY, J. Kendrew, ed. (Blackwdl 
Science, Oxford 1994); and those that are identified in specialized data base, as 
described in Ghosh (1993), Nucleic Acids Res. 21: 3117-3118. 

A transcription factor, such as an activator, typically contains a domain that 

25 recognizes a specific DNA sequence, the response element, and binds it Transcription 
activators may also contain another domain that interacts with other transcription 
factors to initiate transcription or to allow elongation of die RN A transcript Thus, a 
molecule that inhibits binding of an activator to a response dement, either by 
competitively binding to the DNA-btnding domain of the activator or to the response 

30 element, or a molecule that blocks the transcription factor-interacting domain of the 
activator, would be expected to inhibit transcriptional activation. 
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Repressors, also mostly proteins, function in a similar manner, except that 
instead of activating transcription, the repressor binds to the response dement, for 
example, an operator, and blocks transcription. Molecules that are capable of 
competitively binding to repressors activate transcription by removing the repressor 

5 from the response element. Thus, an inhibitor to tibc molecule that competitively binds 
to the repressor would be expected also to inhibit transcriptional activation. 

There exist other modes of molecular action which also function to inhibit 
transcription and which operate by methods distinct from classic repression or 
activation of gene transcription described above. Such methods include, for example, 

10 catalytic events directed against the mRNA of a transcription factor. It would be 
desirable to employ these methods for inhibiting transcription of a target promoter or 
transcription of a transcription factor and to devise a method to screen for such 
inhibitors. 



15 they look for a decrease in biological function or a decrease in a reporter signal that 
reflects inhibition of that function. Very often, a decrease in signal is difficult to 
interpret because the decrease may be the result of factors other than the presence of 
the supposed inhibitor being tested. For example, the decrease in signal may be caused 
by the presence of extraneous matter including toxic chemicals in the media, 

20 inappropriate incubation temperature, inappropriate incubation time, poor condition of 
the ceils used in the test, etc. In order to resolve the matter, a number of time- 
consuming experiments have to be run with a number of controls. 

It would be desirable, therefore, if the presence of an inhibitor can be reflected 
by an increase in reporter gene signal instead of a decrease. 



It is, therefore, an object of the present invention, to provide a screening test 
for inhibitors that is capable of generating an increase in reporter signal in the presence 
of an inhibitor. 



Conventionally, when researchers look for an inhibitor of a biological function. 
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In accordance thereto, there is provided herein a construct, termed a collision 
construct, that contains a nucleic acid molecule, comprising a first regulatory sequence 
that comprises a first promoter, a reporter gene that is under transcriptional control of 
the first promoter, where the reporter gene is capable of providing a detectable signal 
5 upon transcription and translation thereof, and a second regulatory sequence that 
comprises a second promoter, where the direction of transcription under the first 
promoter is opposite to the direction of transcription under the second promoter, where 
regulation of transcription under trie second promoter alters the reporter gene signal, 
and where the first promoter is different from the second promoter. 

10 In accordance to a further object of the present invention, there is provided 

herein the collision construct as above, where the second promoter or the second 
regulatory sequence comprises a first response element that is capable of binding to a 
first binding protein to form a first binding pair, and the formation of the first binding 
pair regulates the activity of the second promoter. 

15 In accordance with another object of the present invention, there is provided 

herein the collision construct as above, where the last nucleotide of the stop codon of 
the reporter gene is separated from the 3* terminus of the second promoter by a 
distance of about less than about 2050 nucleotides. 

In accordance to still another object of the present invention, there is provided 

20 herein the collision construct as above, where one or both of the first promoter and 
second promoter are derived from a promoter or promoter/enhancer region of a gene 
selected from the group consisting of: a viral gene, a bacteriophage gene, a prokaryotic 
gene, and an eukaryotic gene. The eukaryotic gene can be .a yeast or other fungal 
gene, an avian gene, an insect gene or a mammalian gene. Alternatively, the promoter 

25 or promoter/enhancer may be synthetically made, or partly derived and partly 
synthesized. 

In accordance to yet another object of the present invention, there is provided 
herein the collision construct as above, where one or both of the first response element 
and the second response element, the latter optionally present in die first regulatory 
30 region, are derived from the regulatory seqirnce of a gene selected from the group 
consisting of: a viral gene, a bacteriophage gene, a prokaryotic gene, and an 
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eukaryotk gene. The eukaryotic gene can be, for example, a yeast or other fungal 
gene, an insect gene, an avian gene, or a mammalian gene. 

In accordance to a further object of the present invention, there is provided 
herein a method of using the collision construct as above for screening or identifying a 
5 candidate inhibitor for its ability to inhibit transcription under a target promoter, the 
method comprising the steps of providing a cell that contains die collision construct, 
where the second promoter in the construct is the target promoter and the cell is 
capable of expressing the collision construct to produce a reporter gene signal, 
determining reporter gene signal in the absence and presence of the candidate inhibitor, 
10 respectively, and comparing reporter gene signals obtained. An appropriate inhibitor 
is one that is capable of generating an increased reporter signal in the presence of an 
inhibitor. 

In accordance to another object of the present invention, there is provided 
herein a method as above, where the second promoter or the second regulatory region 

15 in the collision construct comprises a response element that is capable of binding to a 
binding protein. The binding protein can be provided by coexpression in a cell of the 
collision construct and a vector that comprises a coding sequence for binding protein. 
Alternatively, the binding protein can be provided by a cell that produces it 
constitutively and the collision construct is then introduced into die cell. Also, die 

20 binding protein can be added directly to the cell that contains the collision construct. 
In accordance to another object of the present invention, there is provided 
herein a method of making the collision construct by providing and linking together a 
first regulatory sequence that comprises a first promoter, a reporter gene that is capable 
of providing a detectable signal upon transcription and translation, a second regulatory 

25 sequence that comprises a second promoter, where the reporter gene is placed under 
regulatory control of die first promoter, the direction of oanscriprioo under the first 
promoter is opposite the direction of transcription of the second promoter, and the first 
promoter is different from the second promoter. 

In accordance to a farther object of the present invention, there is provided 

30 herein a method of production of the collision construct by culturing a host cell that 
comprises the collision construct, for example, a prokaryooc or eukaryotic host cell, 
for example, a bacterial or yeast cell . 
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In accordance with yet another object of the present invention, there is provided 
herein a kit that comprises the collision construct as above, or a vector or host cdl 

* * 

containing the collision construct, with instructions for use thereof in accordant with 
the method described above. 
5 Further objects, features, and advantages of the present invention will become 

apparent from the following detailed description. It should be understood, however, 
that the detailed description, while indicating preferred embodiments of the invention, 
is given by way of illustration only, since various changes and modifications within the 
, spirit and scope of the invention will become apparent to those skilled in the art from 
10 this detailed description. 

> 
* 

a 

■ 

Brief Description of die Drawings 
FIG. I is a schematic representation of a collision construct, transformed in 
HeLa cells, containing the CMV promoter and the HIV-1 promoter running in opposite 

15 directions and another construct for control in which the HIV-1 promoter was absent. 
The graph depicts Tat-dependem inhibition of CMV promoter activity over a range of 
Tat levels, in micrograms of Tat expression vector, from about 0 to 2. The symbol (•) 
represents the alkaline phosphatase gene expression in the collision construct. The 
symbol (0) represents alkaline phosphatase gene expression in the control construct 

20 showing nonspecific reduction of CMV promoter activity by Tat protein. Other 

abbreviations include AP, representing alkaline phosphatase; and CMV, representing 
. the cytomegalovirus promoter/enhancer. 

FIG. 2 is a schematic representation of five different collision constructs, 
designated as #1152, #1161, #1225, #1162. and #1163, and the reporter gene signals 

25 in percent alkaline phosphatase activity, generated by expression of 1 ug DNA each, 
respectively, and determined in the presence of 0.5 ug of Tat protein expression 
plasmid (" +TaT) or 0.5 ug inactive Tat protein expression plasmid ("-Tat") in HeLa 
cells. FIG. 2 shows that specific reduction of alkaline phosphatase expression in die 
presence of HIV-1 Tat protein is dependent on a functional TAR sequence in the LTR. 

30 When die collision construct #1152 was used, in the presence of inactive Tat plasmid* 
reporter gene signal was high. This signal was suppressed in the presence of active Tat 
plasmid by about 6056. What a portion of the TAR sequence was deleted (construct 
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#1161), or when further portions of the promoter were deleted, there was no 
significant difference in reporter gene signal in the presence or absence of TaL Thus, 
suppression of alkaline phosphatase expression in the collision construct is Tat protein 
and TAR sequence dependent. 
5 FIG. 3 is a schematic representation of five different reporter constructs* 

designated as #1085, #1166, #1213, #1167. and #1168, and the reporter gene signals, 
indicated as percent alkaline phosphatase activity, generated by expression of 1 pg of 
each, respectively, in the presence of 0.5 pg Tat protein expression plasmid (" +Tat") 
or 0.5 pg inactive Tat protein expression plasmid ("-Tat") in HeLa cells. The alkaline 

10 phosphatase activity measured with construct #i085 in the presence of active Tax 
protein expression plasmid was set to 100%. FIG. 3 illustrates that HIV-1 promoter 
activation by Tat protein requires the presence of the TAR sequence and that TAR may 
possibly have a silencer function because when a portion of TAR was deleted, reporter 
gene signal in die absence of Tat increased as compared to that when the complete 

15 TAR sequence was present. Addition of Tat to cells containing this construct, #1 166, 
which does not allow the formation of a TAR stem loop structure, enhanced reporter 
gene signal by about 2.5-fold. Additional deletion of the promoter region to include 
the TATA box and Spl binding sites resulted in complete loss of reporter gene signal, 
in the absence or presence of Tat. 

20 FIG. 4 is a schematic representation of different collision constructs containing 

spacer regions of about 21, 94, 153, 406, 556 and 2047 nucleotides, positioned 
between the 3' end of the AP coding sequence, as defined by the stop codon TAA, or 
TGA in construct with a spacer of 21 nucleotides, and the end of the TAR sequence at 
nucleotide +59 in the HIV-1 promoter, with 4-1 nucleotide as the start of 

25 transcription. The sequence of this junction is shown in FIG. 5. 

FIG. 5 shows an argument map of the DNA sequence of the 3* end of the AP 
gene and the 5' end of the mutant HIV-1 promoter in the collision construct #1 152. 
The stop codon of the AP coding region is indicated. Stan of transcription in the HJV- 
1 promoter at position 263 is indicated with +1. Nucleotide substitution st position 

30 268 (substitution of T to Q, position 271 (substitution of C to A), and position 344 
(substitution of A to Q are shown. In addition, an insertion of a T nucleotide at 
position 301 is also shown. 
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FIG. 6 indicates that TAR decoys act as specific inhibitors of HiV-l 
transcription. FIG 6a shows inhibition of HIV-1 transcription in the presence of TAR 
decoys. At the top of FIG 6a is a schematic diagram of an HIV-AP reporter gene 
construct and a plasmid expressing rnultirnerized (8 copies) of transactivation response 
5 sequence, also called TAR decoys. The arrows indicate the direction of transcription 
and translation. The black boxes represent the TAR sequence. At the bottom of FIG. 
6a, a schematic indicates that HcLa cells were transfected with various combinations of 
HIV-AP (lug) reporter and TAR expression plasmids (0.5ug) in the presence or 
absence of 0.5ug of Tat expression vector. AP activity was determined as described 

10 previously and is expressed as fold activation relative to the level obtained with the 
HIV-AP plasmid in the absence of Tat (represented in lane 1). FIG 6b shows 
increased reporter gene expression in the collision construct by inhibition of HIV- 1 
promoter activity. At the top of FIG 6b is a schematic representation of th collision 
construct and a plasmid expressing multimerized copies of the TAR sequence. At the 

IS bottom of FIG. 6b, HcLa cells were transfected with various combinations of the 

collision construct (lug) and increasing amounts of a TAR expression plasmid (lanes 3 
to 5, lug and 2ug respectively) in the absence or presence of 0.5 ug Tat expression 
plasmid. The total DNA concentration in each experiment was kept constant by adding 
a Tat expression vector containing a premature stop codon and a pBJ plasmid 

20 expressing an unrelated fleptin) gem. Each column represents the mean of at least 
three independent experiments. Error bars represent standard error from multiple 
transfectkms. 

Detailed Description of the Preferred Embodiments 
25 The invention described herein draws on previously published work and, at 

times, on pending patent applications. By way of example, such work consists of 
scientific papers, abstracts, or issued patents, and published patent applications. All 
published work cited herein are hereby incorporated by reference. 

The inventors herein have discovered that a collision construct can be made that 
30 can be used for screening inhibitors of promoter or transcriptional activity. By use of 
this collision construct, the presence of a desired inhibitor is indicated by an 
enhancement in reporter gene signal. 
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For a better understanding of the present invention, the terms used herein have 
the following definition: 

A "nucleic acid molecule'' or "nucleic acid sequence" refers to either a DNA 
sequence, a RNA sequence, or complementary strands thereof that comprise a 
5 nucleotide sequence. 

A "regulatory sequence* refers to a nucleic acid sequence encoding one or more 
elements that are capable of affecting or effecting expression of a gene sequence, 
including transcription or translation thereof, when the gene sequence is placed in such 
a position as to subject it to the control thereof. Such a regulatory sequence can be, 

10 for example, a minimal promoter sequence, a complete promoter sequence, an 

enhancer sequence, an upstream activation sequence ("UAS"), an operator sequence, a 
downstream termination sequence, a polyadenylau'on sequence, an optimal 5* leader 
sequence to optimize initiation of translation, and a Shine-Dalgarno sequence. 
Alternatively, the regulatory sequence can contain a combination enhancer/promoter 

15 dement The regulatory sequence thai is appropriate for expression of the present 
construct differs depending upon the host system in which the construct is to be 
expressed. Selection of the appropriate regulatory sequences for use herein is within 
the capability of one skilled in the art. For example, in prokaryotes, such a regulatory 
sequence can include one or more of a promoter sequence, a ribosomal binding site, 

20 and a transcription termination sequence. In eukaryotes, for example, such a sequence 
can include one or more of a prompter sequence and/or a transcription termination 
sequence. If any necessary component of a regulatory sequence that is needed for 
expression is lacking in the collision construct, such a component can be supplied by a 
vector into which the collision construct can be inserted for transformation or 

25 reimroduction into a host celL Regulatory sequences suitable for use herein may be 
derived from any source including a prokaryotic source, an eukaryotic source, a virus, 
a viral vector, a bacteriophage or a linear or circular plasrnkL An example of a 
regulatory sequence is the human irnmunooeficiency virus ("HIV-l") promoter that is 
located in the U3 and R region of the HIV- 1 long terminal repeat (~L.TR"). 

30 Alternatively, me regulatory sequence herein can be a synthetic sequence, for example, 
one made by combining the UAS of one gene with the remainder of a requisite 
promoter from another gene, such as the GADP/ADH2 hybrid promoter. 
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A "minimal promoter" is a naturally occurring promoter that has been 
weakened so that it is not 100% active. For example, a promoter in which all but a 
TATA box has been deleted, such as the minimal fos promoter, as described in 
Berkowitz et al. (1989) Mol. Cell. Biol. 5:4272-4281. 
5 A "reporter gene" refers to a nucleic acid molecule that encodes a polypeptide 

that is capable of providing a detectable signal either on its own upon transcription or 
translation or by reaction with another one or more reagents. Reporter genes suitable 
for use herein are conventional in the art, selection of which is within the capability of 
one skilled in the an. Examples of such reporter genes include that encoding the 

10 enzyme chloramphenicol acetyltransf erase ("CAT"), the luc gene from the firefly that 
encodes luciferase, the bacterial tocZgenc from Escherichia coli that encodes 0- 
galactosidase, alkaline phosphatase ("AP"), human growth hormone ("hGH"), the 
bacterial ^-glucuronidase ("GUS"), and green fluorescent protein ("GFP") t as 
described in Ausubd et al.. CURRENT PROTOCOLS IN MOLECULAR BIOLOGY 

15 (1994). (Greene Publishing Associates and John Wiley & Sons, New York, N.Y.). 

A "response element" refers to a region of a nucleic acid molecule, usually, 

* 

from a regulatory region of a gene, that is capable of specifically binding to a binding 
protein, such as an activator molecule, for activation of transcription or for allowing 
the elongation of a RNA transcript, or a repressor molecule, for inhibition of 

20 transcription. Some response elements are known in the art. Selection of a response 
element that is suitable for use herein is within the capability of one skilled in die art. 

A "binding protein" herein refers to a protein that is capable of specifically 
binding to a response dement for regulation of transcription. Some binding proteins 
are known. Selection of a binding protein suitable for use herein is also within the 

25 capability of one skilled in the art A number of DNA binding proteins as well as 
response dements of the transcription regulatory regions are described in Wtngender 
(1988), Nucleic Adds Res. 16: 1879-1902; Molecular Cdl Biology, J. Darndl. H. 
Lodish & D. Bahimore, (Scientific American Books. New York 1990); and Dhawate 
& Lane (1993), Nudeic Acids Res. 21:5537-5546. One example is the Tat/TAR 

30 combination found in viruses such as human immunodeficiency virus-1 ("HIV-1"), 
. human immunodeficiency virus-2 ("HIV-2"), and simian immunodeficiency vims 
("SIV"). In these viruses, mz/zraavator, "Tat", is the binding protein referred to 
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herein and /ra/tf -activating response dement, "TAR", is the response element referred 
to herein, as described in Jones & Petertin (1994), Ann. Rev. Biocheau 65:717-743; 
and Antoni tt of. (1994), Adv. Vims Res. 43:53-145. Examples of response elements 
and binding proteins besides TAR and Tat include Rev response dement ("RRE"). a 
5 NF-cB binding site, a Spl binding site, and GaM and LexA binding sites. Examples 
of binding proteins include Tat, Rev, NF-cB, Spl f Gal 4. and LexA. 

The term "binding pair" refers to a pair of molecules, including a DNA/DNA 
pair, DNA/RNA pair, protdn/DNA pair, proteui/RNA pair, and a protein/protein pair 
in which the components of the pair bind specifically to each other with a higher 

10 affinity than to a random molecule, such that upon binding, the pair triggers a 

biological response, such as activation of transcription or where the binding protein is a 
repressor, suppresses a biological response, that is, transcription. 

The term "specific binding* in reference to interaction between two molecules 
indicates a higher affinity binding and a lower dissociation constant than non-specific 

15 binding, thus, distinguishing specific binding from background binding. 

The term "regulates." in the context of transcription, denotes both positive and 
negative regulation. Positive regulation is exemplified by activation. Negative 
regulation is exemplified by repression. 

Although the methodology described below is believed to contain sufficient 

20 details to enable one skilled in the art to practice the present invention, other constructs 
not specifically exemplified, such as plasmids, can be constructed and purified using 
standard recombinant DNA techniques as described in, for example, Sambrook a aL 
(1989), MOLECULAR CLONING: A LABORATORY MANUAL, 2nd ed. (Cold 
Spring Harbor Press, Cold Spring Harbor, New York); and under current regulations 

25 described in United States Department of HEW, NATIONAL INSTITUTE OF 
HEALTH (NIH) GUIDELINES FOR RECOMBINANT DNA RESEARCH. 

In one embodiment of the present invention, therefore, the collision construct 
comprises a reporter gene coding sequence that is linked at its 5 V end to a first 
regulatory sequence that comprises a first promoter such that the reporter gene is 

30 placed under transcriptional regulatory control of the first promoter. The reporter gene 
is linked at its 3' end to a second regulatory sequence that comprises a second 
promoter in such a fashion that tnmscriptxonal activity of the second prornocer 
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interferes with the transcriptional activity of the first promoter. This can be done, for 
example, by placing the first regulatory sequence and the second regulatory sequence 
in such a manner that transcription under the second promoter proceeds in a direction 
opposite to direction of t ran sc ript ion tinder the first promoter. 
5 The collision construct herein can be used, for example, for screening or 

identification of an inhibitor of transcriptional activity. For this use, a collision 
construct as described above is made with the promoter to be inhibited (hereafter "the 
target promoter") as the second promoter. The collision construct so made is inserted 
into a vector for expression, with or without the use of linker dements. The 

10 recombinant vector is then introduced into a compatible host cell that can effect the 
expression of the reporter gene. There are known vectors and host cells that can be 
used for these purposes, as described in greater detail below. 

The regulatory sequences suitable for use herein can be any regulatory sequence 
that is compatible for use with the promoters for expression in a desired host cell. For 

15 example, if the collision construct contains a mammalian gene promoter, a regulatory 
sequence derived tram mammalian systems would be desirable. The regulatory 
sequence can be a sequence naturally associated with the promoters selected for use 
herein, or can be a synthetic sequence, or partly synthetic or partly derived. 

The promoters suitable for use herein can be any promoter, including those that 

20 are constitutively active or those that are inducible or regulatable. The promoters can 
be naturally derived or synthetically made. They can be derived from any genes, viral, 
prokaryotic or eukaryotk. The eukaryotic genes can be yeast or other fungal, insect, 
mammalian or avian genes. In a preferred embodiment, the target promoter is derived 
from a virus or a tumor cell. Examples of suitable promoters are described below in 

25 die portion relating to expression systems. 

A suitable promoter for use as the first promoter in the present collision 
construct is one that possesses a transcriptional activity that is about the same strength 
as that of the second promoter. If the first promoter is comparatively much stronger 
* than the second promoter, inhibition of reporter gene signal by the presence of the 

30 second or target promoter may be low, and an enhanced reporter gene signal in the 
presence of an inhibitor may be difficult to detect 
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Thus, if an available promoter to be used in the collision construct as a first 
promoter is too strong, it may be desirable to weaken the promoter by deleting parts 
thereof to generate a minimal promoter. This can be done by a number of methods 
including, for example, restriction enzyme digestion. An example of a weakened 
5 promoter is one in which all but the TATA box is deleted. A promoter is considered 
too strong herein if it drives the expression of the reporter gene to about the same level 
regardless of the presence or absence of the second promoter that drives transcription 
in the opposite direction. 

When the target promoter to be inhibited is constJtutively active, a reporter 

10 gene signal expressed by a transformed host cell containing the collision construct can 
be first established in the absence of any inhibitors. A candidate inhibitor can then be 
introduced or added to the cells and the reporter gene expression can be monitored. 
Alternatively, the transformed cells containing the collision construct can be placed in 
a panel of microliter welts and a panel of candidate inhibitors can be added to the cells. 

IS one inhibitor to each well. A suitable inhibitor is one that generates an enhanced 
reporter signal in its presence as compared with the signal produced in its absence. 

The target promoter to be used herein includes promoters that are subject to 
regulation, such as activation, by the binding of a binding protein to a response 
dement in the proximity of the target promoter. The response element can be 

20 naturally present in the target promoter or can be artificially linked to the target 

promoter. Thus, the present collision construct can be used to identify an inhibitor that 
can inhibit activation of the target promoter by inhibiting binding between the binding 
protein and the response element. This can be achieved by identifying an inhibitor that 
compeddvely binds either to the binding protein or to the response dement. When die 

25 activity of the second promoter is inhibited, the reporter gene activity would return to a 
levd similar to that in the absence of activation by the binding protein. 

The binding protein can be introduced into the cells by addition thereof to the 
medium containing the cdls and gently scraping the cells from die culture dish. 
Alteraativdy , die binding protein can be provided in the form of a vector containing 

30 the coding sequence of the binding protein and regulatory sequences that would allow 
expression thereof . The vector can be introduced into the host cell either at, before, or 
after introduction of die collision construct into die cdl. In another embodiment of the 
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present invention, stable cdl lines containing a collision construct or the coding 
sequence of the binding protein can be first established, and the other sequence 
introduced later. Further alternatively, a stable cdl line containing both the collision 
construct and the binding protein can be made and thereafter used for screening 
5 inhibitors. 

- The collision construct or vector containing the collision construct can be 
introduced into host cells by conventional techniques including dectropo ration, 
calcium phosphate treatment, and tipofectamine transfection. The target promoter can 
be a known promoter with a known nucleotide sequence that can be synthetically made 

10 or derived from a natural source such as a viral gene, a tumor cell gene or a fungal 
gene, for example. Typically, such promoters are excised from the natural source and 
inserted into the collision construct by use of restriction enzymes and/or linkers. 

Alternatively, the sequence of the promoter may not be precisely known, but 
the general location of the promoter is known, for example, the promoter can be 

15 known to reside in a particular restriction fragment In this instance, the restricted 
fragment can be used as the second regulatory sequence of the present collision 
construct. 

In another embodiment of the present invention, it may be desirable to turn off 
the transcription of certain genes that are yet unidentified, for example, one responsible 

20 for production of a cancerous cell, even though the gene or genes responsible for this 
condition have not been identified. For this purpose, mRNA can be isolated from the 
tumor cdl and compared to that obtained from normal cdl fay a common procedure 
known as subtractive hybridization. By substractive hybridization it can be determined 
which mRNA is present in the tumor cdl but absent in normal cdl. A cDNA molecule 

25 can be constructed based on the mRNA so obtained, and a fragment of the genomic 
DN A containing promoter activity can be isolated and used as a target promoter in the 
present collision. 

The collision construct herein can be inserted into a suitable vector for 
introduction into a host cdl for expression and use thereof. A person skilled in the art 
30 would be able to select such a vector and host cdl for such purposes. Moreover, 
examples of suitable vectors and host cdls are described in greater detail below. 
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Thc reporter gene dm is suitable for use herein can be any reporter gene that 
can be expressed in the desired host expression system, as described previously. For 
example, the reporter gene can be 0-galactosidase, among others. 

Similarly, the response element suitable for use herein can be any response 
S element to which inhibition is desired. Examples of such response dements are as 
described above. The response element herein may be part of the promoter sequence 
by conventional techniques such as by synthesis of excision of a known sequence by 
restriction enzyme and linked to the promoter sequence with or without the use of 
linkers. 

10 The binding proteins for use herein may be any binding protein as described 

above. Such binding proteins may be added to the cells containing the collision 
construct for use in screening inhibitors. In doing so, the cells can be scraped off the 
culture dish or well and mixed with the added binding protein. 

Alternatively, die binding proteins can be introduced into the cell in the form of 

15 a vector containing the coding sequence of the binding protein and allowing the 
expression of die coding sequence. In this manner, a stable cell line containing the 
binding protein can be made and used for screening inhibitors. In another embodiment 
of the present invention, a cell that constitunveiy produces the binding protein 
constitutivdy can be used. 

20 In a further embodiment of the present invention, a stable cell line containing 

the collision construct can be made. This can be done by introduction of the collision 
construct into a host cell, by conventional techniques such as etectroporation, calcium 
phosphate treatment, and lipofectarnine or transformation, and selecting a cell or cell 
line that stably expresses the collision construct 

25 A candidate inhibitor to be tested for its inhibitory activity on a target 

promoter, or on transcriptional activity can be added to a cell harboring the collision 
construct in which die target promoter is the second promoter of the construct* and 
optimally, is desired, providing the cell also with a binding protein. Expression of 
reporter gene signal is observed and compared in the absence and in the presence of the 

30 candidate inhibitor, respectively. 

Besides testing a single candidate inhibitor, stably transformed cell lines 
containing the collision construct and optionally containing a vector containing die 
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coding sequence of a binding protein can be placed in microliter wells and a panel of 
inhibitors is added thereto. Reporter gene signals are also observed and compared in 
the absence and presence of the candidate inhibitors, respectively. 

In a further embodiment of the present invention, a method is provided for 

5 screening inhibitors, such as, for example, inhibitors to promoters and transcriptional 
activators. Promoters that can be used for screening inhibitors and used in the 
collision construct can be any desired promoters including, for example, promoters 
from viruses and cancer cells, bacteria and fungi. Transcriptional activators that can 
be inhibited can be any desired transcriptional activator including, for example. Tat, 

10 Rev, NFtcB andSpl. The region of the promoter that can be inhibited herein can be 
any region that binds transcription factors including, for example, TAR, RRE (Rev 
response element), NFkB binding site and Spl binding site. 

An embodiment of the present invention can be tailored to screen in vivo in 
cells a random library of ribozymes for those ribozymes which act as inhibitors of 

15 transcription. Ribozymes may act by catalyucally interrupting transcription by 

targeting an RNA molecule of a transcription factor that interacts with the promoter or 
by targeting the mRNA of a reporter gene. However, the use of the invention for 
screening ribozyme libraries is not limited to any theory of ribozyme function. Unlike 
inhibitors of transcription which inhibit the promoter by interfering with a promoter- 

20 transcription factor interaction, a DNA-protein interaction, ribozymes catalyucally 
disable an RNA molecule. In the context of the present invention, ribozymes which 
inhibit the second promoter in the collision construct can be selected from random 
synthetically derived ribozyme libraries by enhanced reporter gene signal, indicating 
that a ribozyme is acting to disable the second promoter, or the mRNA for a 

25 transcription factor that interacts with that promoter. 

In one embodiment of the present invention, the subunits of a collision 
construct, including a first regulatory sequence, a reporter gene, optionally, a response 
element or dements, and a second regulatory sequence, can all be obtained from 
known sources using conventional techniques of restriction enzyme digestion to remove 

30 these elements from such sources. Alternatively, these subunits can be made 

synthetically by chemical synthesis or serni-synrhericaily by isolating parts thereof from 
known sources and either combining them or by combining them and making any 
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missing parts synthetically. Once obtained or made* these submits can be linked 
together, for example, by use of known linker sequences, so as to place the reporter 
gene under regulatory control of the first regulatory sequence, with the direction of 
transcription going in one direction, 5* to 3* and the second regulatory sequence under 
S regulatory control of the response dement, with the direction of transcription by the 
second regulatory sequence running in a direction, 5' K>3\ but opposite that of the 
first regulatory sequence. Placement of the response element and the second 
regulatory sequence is such chat activation of transcription of the second regulatory 
seque n c e reduces the reporter gene signal upon transcription and translation thereof, 

10 presumably as a result of collision between the two transcription units. A mechanism 
by which the second regulatory sequence generates an anti-sense message that blocks 
translation of the reporter gene cannot be ruled out. 

The spacing between the reporter gene and the response dement can be varied 
to attain die desired levd of inhibition of reporter gene activity. In one embodiment of 

15 the present invention, the sparing between the 3' end of the reporter gene and the +1 
nucleotide of die promoter of the second regulatory sequence is less than 2200 
nucleotides. Preferably, this spacing is less than 1000 nucleotides; more preferably, it 
is less than 800 nucleotides. Most preferably, the spadng is between about 600 
nucleotides and about 20 nucleotides. In particular, spacings of about 21, 94, 153, 

20 406, and 556 base pairs are preferred. In an alternative embodiment, the target or 
second promoter of the collision construct can be optimally placed at a distance of up 
to 1500 base pairs from the 3* terminus of the first promoter. Thus, a reporter gene is 
selected that comprises a sequence that is shorter than or die same as this optimal 
distance. 

25 The first response element can also be linked to the second regulatory sequence 

using linker seqoences or the combined first response dement and second regulatory 
sequence can be removed from a known source, again by restriction enzyme digestion. 

The response dement will usually be placed at the 5* terminus of die second 
regulatory region, in accordance with the nature of most promoters which would 

30 comprise the second regulatory regions of this invention. However, for example, 
when the second regulatory region is comprised of a promoter for which it is 
appropriate to place the response dement at the 3* terminus, the response dement will 
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be most appropriaidy placed at the 3* terminus. Preferably, this response element is 
placed at its natural position in juxtaposition Co the promoter being used. For example, 
when the HIV-1 LTR promoter is used, the response element, TAR, is situated 3' to 
the + 1 nucleotide of the promoter. In other primate immunodeficiency viruses and in 
5 a subset of related nonprirnate lend viruses, the response element will also be most 
appropriately positioned at the 3* terminus of the second regulatory region. See for 
example a discussion of the characteristics of the promoters of such viruses in Cullen, 
Cell (1993) 73:417-420. The response element herein can also be rnuldrnerized to 
produce a more dramatic effect. An example of a response demerit that has been 

10 mulumerized is the [tet-oph which is an operator responsive to tetracycline induction. 

Once made, the collision construct can be introduced into an appropriate host 
cell for expressions thereof, including prokaryooc system such as bacterial, or 
eukaryotic system, such as yeast, insect cell system, or mammalian system, such as 
those described below. The binding protein may also be expressed in the expression 

IS systems described below. 

Expression in Bacterial Cells 

Control dements for use in bacteria include promoters, optionally containing 
operator sequences, and ribosome binding shes. Useful promoters include sequences 

20 derived from sugar metabolizing enzymes, such as galactose, lactose (lac) and maltose. 
Additional examples include promoter sequences derived from btosynihetic enzymes 
such as tryptophan (trp), the ^-lactamase (bla) promoter system, bacteriophage XPL, 
and T7. In addition, synthetic promoters can be used, such as the tor promoter. The 
^lactamase and lactose promoter systems are described in Chang et al.. Nature (1978) 

25 275: 615, and Goeddd et al.. Nature (19790 281: 544; the alkaline phosphatase, 
tryptophan (trp) promoter system are described in Goeddd et al. , Nucleic Acids Res. 
(1980)8: 4057 and EP 36,776 and hybrid promoters such as the foe promoter is 
described in U.S. Patent No. 4,551,433 and de Boer et aL t Proc Nad. Acad Set. 
USA (1983) 80: 21-25. However, other known bacterial promoters useful for 

30 expression of eukaryotic proteins are also suitable. A person skilled in the art would 
be able to operably ligate such p romo t er s to me coding sequences of interest, for 
example, as described in Siebenlist et al.. Cell (1980) 20. 269, using linkers or 
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adaptors to supply any required restriction sites. Promoters for use in bacterial 
systems also generally will contain a Shine-Dalgamo (SD) sequence operably linked to 
the DN A encoding the target polypeptide. For prokaryotic host cells that do not 
recognize and process the native target polypeptide signal sequence, the signal 
5 sequence can be substituted by a prokaryotic signal sequence selected, for example, 
from the group of the alkaline phosphatase, penicillinase, Ipp, or heat stable 
enterotoxin II leaders. The origin of replication from the plasmid pBR322 is suitable 
for most Gram-negative bacteria. 

The foregoing systems are particularly compatible with Escherichia coli. 

10 However, numerous other systems for use in bacterial hosts including Gram-negative 
or Gram-positive organisms such as Bacillus spp.. Streptococcus spp. t Streptomyces 
spp. t Pseudomonas species such as P. aeruginosa. Salmonella typhunurium, or 
Serratia marcescans. among others. Methods for introducing exogenous DN A into 
these hosts typically include the use of Ca02 or other agents, such as divalent cations 

IS and DMSO. DNA can also be introduced into bacterial cells by eiectroporation, 
nuclear injection, or protoplast fusion as described generally in Sambrook et a/. 
(1989), MOLECULAR CLONING: A LABORATORY MANUAL, 2d edition (Cold 
Spring Harbor Press, Cold Spring Harbor, N. Y.). These examples are illustrative 
rather than limiting. Preferably, the host cell should secrete minimal amounts of 

20 proteolytic enzymes. Alternatively, in vitro methods of cloning, e.g. , PCR or other 
nucleic acid polymerase reactions, are suitable. 

Prokaryotic cells used in this invention are cultured in suitable media, as 
described generally in Sambrook et al. (1989), MOLECULAR CLONING: A 
LABORATORY MANUAL, 2d edition (Cold Spring Harbor Press, Cold Spring 

25 Harbor, N.Y.). 

Expression in yeast cells 

Expression and transformation vectors, either extrachromosoraal replicons or 
integrating vectors, have been developed for transformation into many yeasts. For 
30 example, expression vectors have been developed for, among others, the following 

yeasts: Saccharvnxyces cemisiae ,as described in Hinnen et aL, Proc. NatL Acad Set . 
USA (1978) 75:1929; Ito et aL. J. Bacterial (1953) 755:163; Candida albicans as 
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described in Kurtz et al., MoL Cell. Biol. (1986) 6:142; Candida maltosa. as 
described in Kunze et al., J. Banc Microbiol. (1985) 25: 141; Hansemda polymorphs 
as described in Gleeson et al., J. Gen. Microbiol (1986) /J2.3459 and Roggenkamp 
et al.. MoL Gen. Genet. (1986) 202:302); Ouyveromyces Jragilis, as described in Das 

5 et aL t J. Bacteriol. (1984) 755:1165; Kluyvcromyces lactis. as described in De 
Louvencourt et al.. J. Bacteriol. (1983) 154:731 and Van den Berg et at, 
Bio/Technology (1990) & 135; Pidtia guillerimondii, as described in Kunze et al, J. 
Basic Microbiol. (1985) 25: 141 ; Pidtia panaris, as described in Cctgg et al.. MoL 
Ceil. Biol. (1985) 5:3376 and U.S. Patent Nos. 4,837.148 and 4.929.555; 

10 Schtwsaccharomyces pombe. as described in Beach and Nurse. Nature (1981) 

J00.7O6; and Yarrowia lipolytics as described in Davidow et al.. Curr. Genet. (1985) 
70:380 and Gaillardin et al.. Curr. Genet. (1985) 70:49, Aspergillus hosts such as A. 
nidulans. as described in Ballance et al., Biochem. Biophys. Res. Commun. (1983) 
772:284-289; Tilburo et al. Gene (1983) 26:205-221 and Ydtort et al.. Proc Natl 

15 Acad. Sci. USA (1984) 81: 1470-1474. and A. niger. as described in Kelly and Hynes. 
EMBOJ. (1985) 4:475479; Tnchoderma reesia. as described in EP 0 244 234. and 
filamentous fungi such as. e.g, Neurospora. Penicillium. Tofypodadium. as described 
in WO 91/00357. 

Control sequences for yeast vectors are known and include promoters regions 
20 from genes such as alcohol dehydrogenase (ADH), as described in EP 0 284 044. 
enolase, glucokinase, glucose-6-phosphate isornerase, glycerakiehyde~3-pho$phate- 
dehydrogenase (GAP or GAPDH), hexokirtase, phosphofruttokinase, 3- 
phosphoglycerate mutase, and pyruvate kinase (PyK), as described in EP 0 329 203. 
The yeast PH05 gene, encoding acid phosphatase, also provides useful promoter 
25 sequences, as described in Myanohara etaL, Proc. Natl Acad. Sd. USA (1983) £0:1. 
Other suitable promoter sequences for use with yeast hosts include the promoters for 3- 
pbosphoglycerate kinase, as described in Hitzeman cial., J. Biol. Chan. (1980) 
255:2073, or other glycolytic enzymes, such as pyruvate decarboxylase, 
triosephosphate isornerase, and phosphoglucose isornerase, as described in Hess etaL, 
30 /. Adv. Enzyme Reg. (1968) 7:149 and Holland etaL. Biochemistry (1978; 77:4900. 
Inducible yeast promoters having the additional advantage of transcription controlled 
by growth conditions, include those from the fist above and others including the 



WO 97/10360 PCT/US96/13845 

-20- 

promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, 
degradative enzymes associated with nitrogen metabolism, metallothionein, 
glyceraldehyde-3-phosphaie dehydrogenase, and enzymes responsible for maltose and 
galactose utilization. Suitable vectors and promoters for use in yeast expression are 
5 further described in Hitzemarv EP 0 073 657. Yeast enhancers also are 

advantageously used with yeast promoters. In addition, synthetic promoters which do 
not occur in nature also function as yeast promoters. For example, upstream activating 
sequences (U AS) of one yeast promoter may be joined with the transcription activation 
region of another yeast promoter, creating a synthetic hybrid promoter. Examples of 

!0 such hybrid promoters include the ADH regulatory sequence linked to the GAP 
transcription activation region, as described in U.S. Patent Nos. 4,876,197 aid 
4,880,734. Other examples of hybrid promoters include promoters which consist of 
the regulatory sequences of either the ADH2, GALA, GAL10 % or PH05 genes, 
combined with the transcriptional activation region of a glycolytic enzyme gene such as 

IS GAP or PyK, as described in EP 0 164 556. Furthermore, a yeast promoter can 

include naturally occurring promoters of non-yeast origin that have the ability to bind 
yeast RNA polymerase and initiate transcription. 

Other control elements which may be included in the yeast expression vectors 
are terminators, for example, from GAPDH and from the enolase gene, as described in 

20 Holland etaL, J. Biol Chan. (1981) 256: 1385, and leader sequences which encode 

♦ 

signal sequences for secretion. DNA encoding suitable signal sequences can be 
derived from genes for secreted yeast proteins, such as the yeast invertase gene as 
described in EP 0 012 873 and JP 62,096,086 and the a-f actor gene, as described in 
U.S. Patent Nos. 4,588,684, 4,546,083 and 4,870,008 and EP 0 324 274 and WO 

25 89/02463. Alternatively, leaders of non-yeast origin, such as an interferon leader, also 
provide for secretion in yeast, as described in EP 0 060 057. 

Methods of introducing exogenous DNA into yeast hosts are well known in the 
art, and typically include either the transformation of spheroplasts or of intact yeast 
cells treated with alkali cations. Transronnations into yeast can be carried out 

30 according to the method described in Van Soltngen a aL, J. BacL (1977) 730:946 and 
Hsiao ft al. t Proc NaiL Acad. Set USA (1979) 76:3829. However, other methods for 
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imroducing DNA into cells such as by nuclear injection, dectropo ration, or protoplast 
fusion may also be used as described generally in Sambrook et at., cited above. 

For yeast secretion the native target polypeptide signal sequence may be 
substituted by the yeast invertase, a-factor, or acid phosphatase leaders. The origin of 
5 replication from the 2 u piasmid origin is suitable for yeast A suitable selection gene 
for use in yeast is the up\ gene present in the yeast piasmid described in Kingsman a 
al.. Gene (1979) 7: 141 or Tschemper et at.. Gene (1980) /0.157. The trpl gene 
provides a selection marker for a mutant strain of yeast lacking the ability to grow in 
tryptophan. Similarly, Leu2-deffcieni yeast strains (ATOC 20,622 or 38,626) are 
10 complemented by known plasmids bearing the Leu2 gene. 

For intracellular production of the present polypeptides in yeast, a se qu ence 
encoding a yeast protein can be linked to a coding sequence of the desired polypeptide 
to produce a fusion protein that can be cleaved intracellularly by the yeast cells upon 
expression. An example, of such a yeast leader sequence is the yeast ubiquitin gene. 

15 

Expression in Insect Cells 

Baculovirus expression vectors (BEVs) are recombinant insect viruses in which 
the coding sequence for a foreign gene to be expressed is inserted behind a baculovirus 
promoter in place of a viral gene, e.g., polybedrin, as described in Smith and 

20 Summers, U.S. Pat. No., 4,745,051. 

An expression construct herein includes a DNA vector useful as an intermediate 
for the infection or transformation of an insect cell system, the vector generally 
containing DNA coding for a baculovirus transcriptional promoter, optionally but 
preferably, followed downstream by an insect signal DNA sequence capable of 

25 directing secretion of a desired protein, and a ate for insertion of the foreign gene 
encoding the foreign protein, the signal DNA sequence and the foreign gene being 
placed under the tra nsc rip ti onal control of a baculovirus promoter, the foreign gene 
herein being the coding sequence of the desired polypeptide. 

The promoter for use herein can be a baculovirus transc rip tional promoter 

30 region derived from any of die over 500 bacuioviruses generally infecting insects, such 
as, for example, the Orders Lepidoptera, Diptera, Orthoptera, Coleoptera and 
Hymenoptera including, for example, but not limited to the viral DN As cfAutogrupho 
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caUfornica MNPV, Bombyx ami NPV, rrichoplusia ni MNPV, Rachlptusia ou MNPV 
or Galleria melbnetta MNPV, itafer aegypti. Drosophila mdanogaster, Spodoptera 
frugiperda, and Trichoplusia ai. Thus, the baculovirus t ran s cripti onal promoter can be, 
for example, a baculovirus iinmediate-eariy gene IE1 or I EN promoter; an immediate- 
5 early gene in combination with a baculovims ddayed-early gene promoter region 
selected from the group consisting of a 39K and a Hindlll fragment containing a 
ddaycd-eariy gene; or a baculovirus late gene promoter. The immediate-early or 
delayed-earry promoters can be e nhan ced with transcriptional enhancer elements. 
Particularly suitable for use herein is the strong polyhedrin promoter of the 

10 baculovims, which directs a high level of expression of a DN A. insert, as described in 
Friesen et at. (1986) The Regulation of Baculovirus Gene Expression" in: THE 
MOLECULAR BIOLOGY OF BACULOVIRUSES (W. Doerfler, ed ); EP 0 127 839 
and EP 0 155 476; and the promoter from die gene encoding the plO protein, as 
described in Vlak et al.. J. Gen. ViroL (1988) $5*765 -776. 

15 The plasmid for use herein usually also contains the polyhedrin polyadenylation 

signal, as described in Miller et al., Ann. Rev. Microbiol. (1988) 42:177 and a 
procaryotic ampiciltin-resistance (amp) gene and an origin of replication for selection 
and propagation in E. coli. DNA encoding suitable signal sequences can also be 
included and is generally derived from genes for secreted insect or baculovirus 

20 proteins, such as the baculovirus polyhedrin gene, as described in Carbonell et al., 
Gene (1988) 75:409, as well as mammalian signal sequences such as those derived 
from genes encoding human a-interfcroa as described in Maeda et al.. Nature (1985) 
525:592-594; human gastrin-rekasing peptide, as described in Lebacq-Verheyden et 
of., Mol Cell. Biol. (1988) 8:3129; human IU2, as described in Smith et al. Proc. 

25 NatL Acad. So. USA (1985) 82:8404; mouse IL-3, as described in Mryajima et al., 
Gene (1987) 55:273; and human ghjcocerebrosidase, as described in Martin et al., 
DNA (1988) 7.-99. 

Numerous bacukmral strains and variants and corresponding permissive insect 
host cells from hosts such as Spodoptera frugiperda (caterpillar), Aedes aegypti 
30 (mosquito), Aedes atbopictus (mosquito), Drosophila melanogaster (fraitfty), and 

Bomby* mori host cells have been identi tied and can be used herein. See, for example, 
the description in Luckow et at, Bk*Technalogy(l98&) (£47-55, Miller etaL. in 
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GENETIC ENGINEERING (Setiow. J.K. et aL eds.), Vol. 8 (Plenum Publishing, 
1986), pp. 277-279, and Maeda a al.. Nature. (1985) 575:592-594. A variety of such 
viral strains are publicly available, eg., the H variant of Autographa California: 
NPV and the Bn>5 strain of Bcmbyxmori NPV. Such viruses may be used as the 
5 virus for transfection of host cells such as Spodoptem frugiperda cells. 

Other baculovirus genes in addition to the polyhedrin promoter may be 
employed to advantage in a baculovirus expression system. These include immediate- 
early (alpha), delayed-eariy (beta), late (gamma), or very late (delta), according to the 
phase of the viral infection during which they are expressed. The expression of these 

10 penes occurs sequentially, probably as the result of a "cascade" mechanism of 

transcriptional regulation. Thus, the immediate-early genes are expressed immediately 
after infection, in the absence of other viral functions, and one or more of the resulting 
gene products induces transcription of the ddayed-eariy genes. Some ddayed-early 
gene products, in turn, induce transcription of late genes, and finally, the very late 

15 genes are expressed under the control of previously expressed gene products from one 
or more of the earlier classes. One relatively well defined component of this 
regulatory cascade is IE1, a preferred immediate-early gene of Autographo californicc 
nuclear polyhedrosis virus (AcMNPV). IEI is expressed in the absence of other viral 
functions and encodes a product that stimulates the transcription of several genes of the 

20 delayed-eariy class, including the preferred 39K gene, as described in Guarino and 
Summers, /. Viral. (1986) 57.563-571 and /. Virol. (1987) 61:2091-2099 as well as 
late genes, as described in Guarino and Summers, Virol. (1988) 762:444-451. 

Immediate-early genes as described above can be used in combination with a 
baculovirus gene promoter region of the delayed-eariy category. Unlike the 

25 immediate-early genes, such ddayed-earry genes require the presence of other viral 
genes or gene products such as those of the immediate-tarty genes. The combination 
of imrnediate-early genes can be made with any of several delayed-eariy gene promoter 
regions such as 39K or one of the delayed-eariy gene pr omoters found on the Hindlll 
fragment of the baculovirus genome In the present instance, the 39K promoter region 

30 can be linked to the foreign gene to be expressed such that expression can be further 
controlled by the presence of IEI, as described in L. A. Guarino and Summers 
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(1986a), cited above; Guarino A Summers (1986b) J. ViroL. (1986) 60:215-223, and 
Guarino a al. (1986c). J. ViroL (1986) 60:224-229. 

Additionally, when a combination of immediate-early genes with a delayed- 
early gene promoter region is used, enhancement of the expression of heterologous 
5 genes can be realized by the presence of an enhancer sequence in direct cis linkage 
with the delayed-early gene promoter region. Such enhancer sequences are 
characterized by their enhancement of delayed-early gene expression in situations 
where the immediate-early gene or its product is limited. For example, the hr5 
enhancer sequence can be linked directly, in cis, to the delayed-early gene promoter 

10 region, 39K, thereby enhancing the expression of the cloned heterologous DNA as 
described in Guarino and Summers (1986a), (1986b). and Guarino a al. (1986). 

The polyhedrin gene is classified as a very late gene. Therefore, transcription 
from the polyhedrin promoter requires the previous expression of an unknown, but 
probably large number of other viral and cellular gene products. Because of this 

15 delayed expression of the polyhedrin promoter, staie-of-the-an BEVs, such as the 
exemplary BEV system described by Smith and Summers in, for example, U.S. Pat. 
No., 4,745,051 will express foreign genes only as a result of gene expression from the 
rest of the viral genome, and only after the viral infection is well underway. This 
represents a limitation to the use of existing BEVs. The ability of the host cdl to 

20 process newly synthesized proteins decreases as the baculovirus infection progresses. 
Thus, gene expression from the polyhedrin promoter occurs at a time when the host 
cell's ability to process newly synthesized proteins is potentially diminished for certain 
proteins. As a consequence, the expression of secretory glycoproteins in BEV systems 
is complicated due to incomplete secretion of the cloned gene product* thereby trapping 

25 the cloned gene product within the cell in an incompletely processed form. 

While it has been recognized that an insect signal sequence can be used to 
express a foreign protein that can be cleaved to produce a mature protein, the present 
invention is preferably practiced with a mammalian signal sequence. 

An exemplary insect signal sequence suitable herein is the sequence encoding 

30 for a Lcpidoptcran adipokinetic hormone (AKH) peptide. The AKH family consists of 
short blocked neuropeptides that regulate energy substrate mobilization and metabolism 
an insects. In a preferred embodiment, a DNA sequence coding for a Lepidopteran 
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Manduca sexta AKH signal peptide can be used. Other insect AKH signal peptides, 
such as those from the Orthoptera Sddstocerca grtgaria locus can also be employed to 
advantage. Another exemplary insect signal sequence is the sequence coding for 
Drosophila cuticle proteins such as CPI, CP2, CPS or CP4. 
5 Currently, the most commonly used transfer vector thai can be used herein for 

introducing foreign genes into AcNPV is pAc373. Many other vectors, known to 
those of skill in the art* can also be used herein. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in a kit form 
from companies such as lnvitrogen (San Diego CA) ("MaxBac" kit). The techniques 

10 utilized herein are generally known to those skilled in the an and are fully described in 
Summers and Smith. A MANUAL OF METHODS FOR BACULO VIRUS VECTORS 
AND INSECT CELL CULTURE PROCEDURES. Texas Agricultural Experiment 
Station Bulletin No. 1555, Texas A&M University (1987); Smith et aL, Moi Cell. 
Biol. (1983) 3:2156, and Luckow and Summers (1989). These include, for example, 

15 the use of pVL985 which alters the polyhedrin start codon from ATG to ATT, and 
which introduces a BamHl cloning site 32 base pairs downstream from the ATT, as 
described in Luckow and Summers, Virology (1989) /7:3l. 

Thus, for example, for insect cell expression of the present polypeptides, the 
desired DNA sequence can be inserted into the transfer vector, using known 

20 techniques. An insect cell host can be cotransformed with the transfer vector 

containing the inserted desired DNA together with the genomic DNA of wild type 
baculoviros, usually by cotransfectton. The vector and viral genome are allowed to 
recombine resulting in a recombinant virus that can be easily identified and purified. 
The packaged recombinant virus can be used to infect insect host cells to express the 

25 desired polypeptide. 

Other methods that arc applicable herein are the standard methods of insect cell 
culture, cotransfection and preparation of plasmids are set forth in Summers and Smith 
(1987), cited above. This reference also pertains CD the standard methods of cloning 
genes into AcMNPV transfer vectors, plasmid DNA isolation, transferring genes into 

30 the AcmMNPV genome, viral DNA purification, radioUbeling recombinant proteins 
and preparation of insect cell culture media. The procedure for the cultivation of 
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vi ruses and cells are described in Votkman and Summers, /. Virol. (1975) 79:820-332 
and Volkrnan, et al.. J. YmL ( 1976) 7*820*32. 

Expression in Mammalian Cells 
5 The polypeptides of the present invention can be expressed in mammalian cells, 

such as HeLa cells, using promoters and enhancers that are functional in those cells. 
Synthetic non-natural promoters or hybrid promoters can also be used herein. For 
example, a T7T7/T7 gene promoter can be constructed and used, in accordance with 
Chen et al.. Nucleic Adds Res. 22:2114-2120 (1994), where the T7 polymerase is 

10 under the regulatory control of its own promoter and drives the transcription of the 
inserted coding seque n ce, which is placed under the control of another 17 promoter. 
Also suitable for use herein is the gene for the (XAAT/etihartcer-buiding protein 
C/EBPa, as described in Birkenmeier et al.. Genes Dev. (1989) J:l 146-1156. 

Typical promoters for mammalian cell expression include the SV40 early 

15 promoter, the CMV promoter, the mouse mammary tumor virus LTR promoter, the 
adenovirus major late promoter (Ad MLP), and the herpes simplex virus promoter, 
among others. Other non-viral promoters, such as a promoter derived from the murine 
metallothionein gene, will also find use in mammalian constructs. Mammalian expression 
may be either constitutive or regulated (inducible), depending on the promoter. Typically, 

20 transcription termination and polyadenylatioo sequences will also be present, located 3' to 
the translation stop codon Preferably a sequence for optimization of initiation of 
translation, located 5* to the polypeptide coding sequence, is also present. Examples of 
transcription terminator/ polyadenytation signals include those derived from SV40, as 
described in Sambrook et ai (1989), ched rxeviousty. barons, containing splice donor 

25 and acceptor sites, may also be designed into the constructs of the present invention. 

Enhancer elements can also be used herein to increase expression levels of the 
mammalian constructs. Examples indude the SV40 carry gene enhancer, as described in 
Dykemaero/., EMBO J. (1985) 4:761 and the eriharra/promrjter derived from the 
long terminal repeat (LTR) of the Rous Sarcoma Virus, as described in Gorman etaL % 

30 Proc Natl. Acad. ScL USA (1982b) 7*6777 and human cytomegalovirus, as 

described in Bosharte/tfi. Cell (1985) 47:521. A leader sequence can also be present 
which includes a sequence encoding a signal peptide, to provide for the secretion of the 
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foreign protein in mammalian cells. Preferably, there are processing sites encoded 
between the leader fragment and the gene of interest such that the leader sequence can 
be cleaved either in vivo or in vitro. The adenovirus tripartite leader is an example of 
a leader sequence that provides for secretion of a foreign protein in mammalian cells. 
5 There exist expression vectors that provide for the transient expression in 

mammalian cells of DNA encoding the target polypeptide. In general, transient 
expression involves the use of an expression vector that is able to replicate efficiently 
in a host cell, such that the host cell accumulates many copies of the expression vector 
and, in turn, synthesizes high levels of a desired polypeptide encoded by die expression 

10 vector. Transient expression systems, comprising a suitable expression vector and a 
host cell, allow for the convenient positive identification of polypeptides encoded by 
cloned DNAs, as well as for the rapid screening of such polypeptides for desired 
biological or physiological properties. Thus, transient expression systems are 
particularly useful for purposes of identifying analogs and variants of the target 

15 polypeptide that have target polypeptidc-Iikc activity. 

Once complete, the mammalian expression vectors can be used to transform any 
of several mammalian cells. Methods for introduction of heterologous polynucleotides 
into mammalian cells are known in the art and include dextran-mediated transection, 
calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 

20 dectropo ration, encapsulation of the polynucleotides) in liposomes, and direct 
microinjection of the DNA into nuclei. General aspects of mammalian cell host 
system transformations have been described by Axel in U.S. Patent No. 4,399,216. 

Mammalian cell lines available as hosts for expression are also known and 
include many immortalized cell lines available from the American Type Culture 

25 Collection (ATCC), including but not limited to, Chinese hamster ovary (CHO) cells, 
HeLa ceils, baby hamster kidney (BHK) cells, monkey kidney cells (COS), human 
hepatocellular carcinoma cells (e.g.. Hep G2), human embryonic kidney cells, baby 
hamster kidney cells, mouse Sertoli ceils, canine kidney ceils, buffalo rat liver ceils, 
human lung cells, human liver ceils, mouse mammary tumor cells, as well as others. 

30 The mammalian host ceils used to produce the target polypeptide of this 

invention may be cultured in a variety of media. Commercially available media such 
as Ham s FiO (Sigma), Minimal Essential Medium ([MEM]. Sigma), RPMM640 



WO97/1U60 PCT/US96/13845 

-28- 

(Sigma), and Duibecco's Modified Eagle's Medium ([DM EMI, Sigma) are suitable for 
culairing the host cells. In addition, any of the media described in Ham and Wallace* 
Metk Em. (1979) 55:44, Barnes and Sato. AnaL Biochem. (1980) 102:255. U.S. 
Patent Nos. 4 f 767, 704, 4,657,866, 4,927,762, or 4.560.655. WO 90/103430. WO 
5 87/00195, and U.S. RE 30,985, may be used as culture media for the host ceils. Any 
of these media may be supplemented as necessary with hormones and/or other growth 
factors such as insulin, transferrin, or epidermal growth factor, salts (such as sodium 
chloride, calcium, magnesium, and phosphate), buffers (such as HEPES), nucleosides 
(such as adenosine and thymidine), antibiotics (such as Gemamycin™ M drug), trace 

10 dements (defined as inorganic compounds usually present at final concentrations in the 
microtnolar range), and glucose or an equivalent energy source. Any other necessary 
supplements may also be included at appropriate concentrations that would be known 
to those skilled in the an. The culture conditions, such as temperature, pH, and the 
like, are those previously used with the host cell selected for expression, and will be 

15 apparent to the ordinarily skilled artisan. 

The collision construct can be introduced into host cells by conventional 
techniques including lipofectamine, DEA&dexiran, eiectropo ration, and calcium 
phosphate, and as described above. 

For use for in screening inhibitors, a stable cell line that contains the collision 

20 construct can be made and selected. For example, the collision construct is 

electroporatcd together with a selectable marker gene, for example neomycin. G418 
resistant colonies are assayed for the existence and functionality of the collision 
construct The cell line can be prokaryotic or eukaryotic in origin. Preferably, the 
cell line is eukaryotk, more preferably, inammalian. In a preferred embodiment, the 

25 cdl line can be derived from HeLa cells, T-cdls, B-cells and 293 cells. 

The stable cell line containing the collision construct can also be cotransfected 
with a plasmid containing a binding protein. The coding sequence for the binding 
protein can be inserted into an expression plasmid, such as pCG, a pEVRF derivative, 
described in Giese et al., Gates & Development (1995) £995-1008. pEVRFis 

30 described in Matthias et aL, Nudeic Adds Res. (1989) 77:6418. pCG has a modified 
polylinker. and directs expression in rnammalian cells from die human cytomegalovirus 
promoter/enhancer region. The coding sequence for the binding protein can also be 
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inserted into the expression plasmd pCDNA (Ooniech, Palo Alto, CA) The DNA 
construct encoding the binding protein can be made by standard methods of 
recombinant DNA technology as described in Sambrook et at. (1989) MOLECULAR 
CLONING: A LABORATORY MANUAL, 2nd ed. (Cold Spring Harbor Press. Cold 
5 Spring Harbor, N Y.) and Ausubel et al„ cited previously. 

Alternatively, the collision construct can be transfected into a cdl line that 
consdtutively expresses a binding protein, such as, for example, HeLa cells, T-cdls, 
B-cells or 293 cells that have been stably transfected with a vector that directs 
expression of the binding protein. 

10 Further alternatively, the host cell carrying the collision construct for use in 

screening can be exposed to a binding protein that is added into the medium containing 
the transfected cells. In a preferred embodiment, a Tat protein expression vector is 
added to a cell line carrying the collision construct The amount of the expression 
vector can be varied depending upon the extent of inhibition of reporter gene activity 

15 desired. It is desirable to work in the range of about 50% to 90% AP activity, 
preferably 60% to 80% AP activity in the absence of activation of the second 
regulatory sequence and in the range of about 10% to 50%, preferably 20% to 40% 
AP activity, in the presence of activation of the second regulatory sequence. 

Thus, in using the host cell that contains the collision construct for screening 

20 inhibitors, the reporter gene activity is determined in die absence of die binding 
protein, in the presence of the binding protein, and in the presence of candidate 
inhibitors being screened. A candidate that increases reporter gene activity in die 
presence of a binding protein that activates transcription, for example, can be selected 
and further tested as an inhibitor to transcriptional activation. 

25 In another embodiment of the present invention, kits can be made that contain 

the present collision construct for screening for inhibitors of transcriptional activation. 
Such kits can include vectors or host cells containing one or more of the present 
collision constructs in suitable containers, along with the reagents and materials 
required for the conduct of the assay or descriptions of those remaining reagents 

30 necessary, as well as a suitable set of assay instructions. Other materials or reagents 
. can include, for example, diluents, buffers, host cells and other reagents, appropriate 
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containers such as tubes, plates, etc., and may be included in the kit, or described in 
the instructions. 

The present invention will now be illustrated by reference to the following 
examples which set forth particularly advantageous embodiments. However, it should 
S be noted that these embodiments are illustrative and are not to be construed as 
restricting the invention in any way. In particular, other promoters or response 
elements and other reporter gene can be substituted for the ones described herein. 

Example 1 

10 Construction of the Collision Construct with CMV and HFV-1 Promoters 

and an Alkaline Phosphatase Reporter Gene: Construct #1 152 
In one embodiment of the present invention, the collision construct containing 
the human cytomegalovirus ChCMV") promoter, the gene for the secreted form of the 
human placental heat-stable alkaline phosphatase ("AP") and the promoter of the 

15 human immunodeficiency virus- 1 (*HI V-l was generated from precursor constructs 
as described below. 

A nucleotide sequence comprising the hCMV promoter and a region derived 
from the herpes simplex thymidine kinase, cfc, gene for the optimal initiation of 
translation (hereafter "the tk upstream region"), was isolated from plasmid pCG. 

20 Plasmid pCG, described in Giese et al.. Genes and Development (1995) £995-1008, is 
a pEVRF derivative, as described in Matthias et al.. cited previously. pCG has a 
modified polylinker, and directs expression in mammalian cells from the human 
cytomegalovirus promoter/enhancer region. The hCMV promoter was isolated from 
pCG by digestion with restriction enzymes EcoRl and Xbal (Boehringer Mannheim, 

25 Germany). Restriction digestion for purposes herein was conducted essentially as 

described in Sambrook et al., cited previously, and Ausubel et at, cited previously, or 
in accordance with the manufacturer's recotnmendations. For example, digestions 
were typically conducted using 2 pi of 10 x restriction buffer, 0 .1 to 4 ug of DNA in 
water or TE buffer, 1 -5 U of enzyme per pg of DNA and water to obtain a total 

30 volume of 20 pi. The components of the digest were incubated at 37°C from about 10 
minutes to overnight, depending on the amount of DNA being digested. 
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The hCMV promoter sequence isolated from pCG was then iigated into plasmid 
pTZ19U, purchased from Pharmacia (Piscataway, N.J.)* that had been cleaved with 
the same restriction enzymes, EcoRl and Xbal. The resulting plasmid was designated 
construct #1080. Ligation reactions herein were essentially performed as directed by 
5 the manufacturer of the ligase (Boehringer Mannheim, Germany) and along the 
principles described in Sambrook et aL t cited previously, and Ausubd ct of., cited 
previously. Briefly, approximately 10 to 100 fempco motes (10" 13 ) of vector DNA 
were Hgated with 3 to 10 fold molar excess of insert DNA in a final volume of 20 pi 
using T4 DNA ligase (Boehringer Mannheim, Germany) at 16°C from about 10 

10 minutes to overnight, depending on the amount of DNA being ligatcd. 

The coding region of the alkaline phosphatase gene was isolated from plasmid 
pSEAP-Basic, purchased from Qontech (Palo Alto, CA) by restriction with Hin&XU 
and Sail and Iigated into plasmid Bluescript, purchased from Stratagene (La Jolla, CA) 
that had been cleaved with the same enzymes. The resulting plasmid, designated 

IS construct #1067, was cleaved with Qal and Safl, the 5 '-overhangs were filled in by 

« 

Klenow enzyme, (Boehringer Mannheim, Germany) and the ends were religated. Fill- 
in reactions described herein were conducted as directed by the manufacturer of the 
Klenow enzyme (Boehringer Mannheim, Germany). The resulting plasmid construct 
was designated #1074. This manipulation restored the Sail restriction site. 
20 The coding region of the alkaline phosphatase gene was isolated from construct 

#1074 by restriction with Xbal and Sail and Iigated into construct #1080 that had been 
cleaved with the same enzymes. This ligation resulted in an intermediate recombinant 
plasmid containing an AP gene that was out-of-frame with respect to the ft regi 
For production of an in-frame fusion, the intermediate recombinant plasmid containing 
25 the AP sequence was cleaved with Xbal and Hindi U, the S*-overhangs were fiUed-in 
and the ends were religated. This manipulation also restored the Xbal site. The 
resulting plasmid was designated construct #1112. 

The HIV-1 promoter was isolated from plasmid pHIVSCAT, as described in 
Selby and Peteriin, Cell (1990) 6*2:769-776, by treatment with Aspl\% and HindSR. 
30 The isolated fragment was Iigated into plasmid Bluescript from Stratagene (La Jolla, 
CA), that had been cleaved with the same enzymes. The resulting plasmid was 

met #1075. The particular HIV-1 promoter in pHIVSCAT contains 
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several point mutations that have been iiuroduced to create restriction enzyme 
recognition sites. Comparison of the mutant promoter in pHlVSCAT with the wild- 
type HIV-1 promoter did not show airy significant differences in activity. The 
sequence of the mutant HIV-1 promoter is listed in FIG. 5. The HIV-1 promoter was 

5 isolated from construct #1075 by restriction with £coRV and then ligated into plasmid 
pTZl8U from Pharmacia (Piscataway, NJ), that had been cleaved with Smal. The 
resulting plasmid was designated construct #1149. 

To generate the final collision construct of this example, the HIV-1 promoter 
was isolated from construct #1149 by restriction with Sail and AspllS and ligated into 

10 construct #1112 that was cleaved with the same enzymes. The resulting plasmid was 
designated construct #1 152, the collision construct. 

In the collision construct, the direction of transcription from the hCMV and mat 
from the HIV-1 promoter were in opposite directions. The distance between the end of 
the AP coding region, as defined by the stop codon TAA, and the start of transcription 

15 in the HIV-1 promoter, as defined by + 1 of the promoter sequence was about 213 
nucleotides. 

Example 2 

HIV Tat Protein Dependent Reduction of Alkaline Phosphatase Activity 
20 HeLa cells were transiently transfected with the following: (1) 1 ug of plasmid 

#1 152, the collision construct, and (2) various amounts of plasmid pSWfdVTAT: 0 
ug, 0. 1 ug, 0.3 ug, 0.5 ug, and 1.5 ug, respectively. Plasmid pSV7fdVTAT is herein 
referred to as the Tat expression plasmid, and its construction is described below. The 
amount of DN A in each transf ection assay was kept constant by adding Tat-toacrive 
25 plasmid, the construction of which is also described below. Results are shown in FIG. 
1. For transient transection of HeLa cells described herein, lirjofectamine (purchased 
from BRL, Gaithersburg, MD) was used in accordance lo the manufacturers* 
instructions. For uansfections hereafter, except as expressly provided otherwise, 1 ug 
of the collision construct was used together with either 0.5 ug of Tat expression 
30 plasmid (pSV7fd/TAT) and/or 0.5 ug of Tat-Inacrive plasrnkf. After approximately S 
hours, the cells were washed and incubated in fresh DME medium, supplemented with 
10% fetal calf serum. About 16-20 hours after transection, aliquoo of the supernatant 
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were analyzed for alkaline phosphatase activity according to the manufacturers* 
conditions (Clontech, Palo Alto. CA). 

The Tat expression plasraid and the Tat-wactive plasmid were consmicted or 
used as follows. Plasmid pSV7fd/TAT was obtained from Peteriin at University of 
5 California at San Francisco and was constructed and used as described in Sdby and 
Peteriin, Cdl (1990) 62:769-776. Plasmid pS V7fd/TAT contains the coding region 
for the transcriptional activator Tat from HIV-1. In this plasmid, Tat expression is 
tinder control of the SV40 early promoter. Inactive Tat expression plasmid was 
generated by restriction of plasmid pSV7fd/TAT with Xbal. The DNA ends were 

10 ' fitted-in and the plasmid reiigated. this procedure generated a frame shift mutant that 
resulted in a premature stop codon and no functional Tat protein expression. The 
resulting construct is referred to as Tat-inactive plasmid, and was used herein to keep 
the total amount of DNA in each transfection constant. 

FIG. 1 shows a reduction of AP activity that was dependent on the amount of 

IS Tat expression plasmid added. Over the range of 0 to 2 fig of Tat expression plasmid, 
reporter gene activity decreased from about 100% to about 25%, resulting in about 
75% inhibition at the highest level tested. Reporter activity was about 40% when Tat 
was present at a level of between about 0.3 to 0.5 \xg of Tat expression vector. At this 
level, there is almost no nonspecific effect of Tat protein on CMV promoter activity. 

20 

Example 3 

s Study of the Dependence of TAR on Tat for Reduction in AP Activity Using Deletion 

Constructs Derived from Construct #11 52 

To examine the dependence of the presence of the TAR sequence in the HIV-1 
25 promoter for the observed Tat-dependent reduction in AP activity in the previous 

example, the following deletions were introduced into construct #1 152: (1) deletion of 
a major portion of the TAR sequence, construct #1 161; (2) deletion of the entire TAR 
sequence, construct #1225; (3) deletion of the entire TAR sequence and the TATA 
box, construct # 1 1 62; and (4) deletion of the entire TAR sequence, the TATA box, 
30 and the three Spl binding sites, construct #1163. 
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Construct #1 152 was restricted with BglU and AzmHI and the ends rctigated. 
The resulting plasmid represents construct #1161. This deletion removes about 36 
nucleotides of the TAR sequence. 

Complete removal of the TAR sequence was obtained by deleting nucleotides 
5 from position -14 to +59, by PGR using DNA of construct #1152 as template and 
primer #613 (S'-GCGAAGC^ T^GCAGCTGC^TATATGCAGCA> 3 ^ ) and reverse 
primer, purchased from New England Btolabs, Beverly, MA. The underlined 
sequence represents nucleotides -35 to -13 of the HIV- 1 promoter beginning with 
HindlU restriction site; the non-underlined portion includes the HindlU restriction site 
10 that begins after the nucleotides GCG at the 5'-end. The resulting DNA fragment was 
digested with HindlU and Asp718 and religaxed into construct #1 152 cleaved with the 
same enzymes. This manipulation also removed the start of transcription and 
generated construct #1225. 

A third deletion construct was prepared from construct #1 152 by restriction 
15 with Xbal and religanon. The resulting plasmid is designated construct #1 162. This 
deletion removes the complete TAR sequence and the region containing the TATA box 
sequence. 

A fourth deletion construct was prepared from construct #1 1 52 by restriction 
with Smal and BamHl and the ends reiigated. The resulting plasmid is designated 

20 construct #1163. This deletion removes the complete TAR sequence, the TATA box 
sequence and the three Spl binding-site s eq u ences. 

FIG. 2 shows the results of measuring the specific reduction of alkaline 
phosphatase expression in the presence of 0.5 ug HIV-1 Tat protein expression plasmid 
in the deletion constructs #1 161 . #1225, #1 162 and #1 163. as compared with construct 

25 #1152. For construct #1152, in the absence of Tat, activity of the AP gene was set to 
100%. in the presence of Tat* activity of the AP gene was reduced to 36% ± 7%. 
Thus, in the presence of Tat, expression of AP was reduced by about 64% . For 
construct #1161, in which most the TAR sequence was dektcd, expression of the AP 
gene in the absence of Tat was about 55%. Addition of Tat only reduced AP gene 

30 expression to about 53%, indicating that mhibition of reporter gene activity by Tat 
protein is TAR sequence dependent. Construct #1225, containing a deletion in the 
TAR sequence, produced about 150% AP gene activity in the absence of Tat, and 
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about 140% AP gene activity in the presence of Tat. Construct #1 162, containing a 
deletion in the TAR sequence and in the TATA box, produced about 150% AP gene 
expression in the absence of Tat protein, and about 153% AP gene expression in the 
presence of Tat Construct #1 163, containing deletions in the TAR sequence, the 
5 TATA sequence and the Spl binding sites, generated a still higher level of AP gene 
expression of about 160% in the absence of Tat, and about 150% in the presence of 
Tat These results demonstrate the requirement for a functional TAR sequence and a 
TATA box sequence for activation of the HIV-1 promoter. In addition, significant 
upregulation of HIV-1 promoter activity by Tat protein requires a functional TAR 
10 sequence. 

Example 4 

Effect of Various Deletions of the HIV-1 Promoter on Transcriptional Activation and 

Construction of TAT-lnactive Plasmid 

15 Other constructs were made to test the effect of deletion of portions of a 

promoter region on transcriptional activation, using the AP gene as the reporter gene. 
Construct #1085 was made using the mutant HIV-1 promoter as described in Example 
3, as an Asp718IHindIlI DNA fragment isolated from pHIVSTAT and ligated into 
plasmid pSEAP-Bask (Qontech, Palo Alto, CA) that was cleaved with the same 

20 enzymes. This operation linked the HIV-1 promoter to the AP gene. Constructs 

#1166, #1213, #1167 and #1168 were made by isolating the HIV-1 promoter dderion 
constructs from construct #1161, #1215, #1162 and #1163, described above, and 
iigarJng them as Asp718/HUuiIII fragments into plasmid pSEAP-Bask. Thus, construct 
#1085 contains the entire HIV-1 promoter. Construct #1 166 lacks about 36 

25 nucleotides of the TAR sequence. Construct #1213 lacks all of the TAR sequence and 
nucleotides comprising the original start of transcription. Construct #1167 lacks the 
TAR sequence and the TATA box. Construct #1168 lacks the TAR sequence, the 
TATA box, and the three Spl binding sites. HeLa cells were transkndy transfected 
with the latter const ru cts as described before and AP gene expression was observed as 

30 described before. 

FIG. 3 shows that in the absence of Tat, the full-length HIV-1 promoter in 
construct #1085, attaining the TAR sequence, was unable to induce significant 
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expression of the AP gene, resulting in about 2% AP gene activity. In the presence of 
0.5 ug protein expression piasmid Tat, construct #1085 activated AP gene expression. 
Tbe AP gene activity produced by construct #1085 was set to 100%. Thus, in Che 
presence of Tat, there was approximately, a 50-foW activation. Deletion construct 
5 #1166 that lacked most of the TAR sequence showed only about 2% AP activity in the 
presence of Tat protein. However, in the absence of Tat protein, the truncated HIV-1 
promoter produced about 12% AP activity. Deletion constructs #1213, #1162 and 
#1 163 showed no basal and also no Tat-inducible promoter activity. 

10 Examples 

Constructs to Study the Effect of Spacing on the Function of the Collision Construct 
Constructs having varying distances or spacer regions between the first and 
second promoters or between the second promoter and reporter gene were made to 
study the effect of spacing on the function of the collision construct. 

15 The various collision constructs made contained spacer regions of 21 

nucleotides (construct # 1 190), 94 nucleotides (construct #1 181), 153 nucleotides 
(construct #1 187), 406 nucleotides (construct #1 188), 556 nucleotides (construct 
#1189) and 2047 nucleotides (construct #1 159), positioned between the 3* end of the 
AP coding sequence, as defined by the stop codon, and the end of the TAR sequence at 

20 +59 nucleotide in the HIV-1 promoter, with +1 nucleotide as the start of 

transcription. Those constructs were made as follows: Construct #1190 was made by 
restriction digest of construct #1 152 with Hpal and Hindlll and retigatkm. This 
manipulation changed the stop codon from TAA to TGA. Construct #1181 was made 
by restriction digest of construct #1152 with SaU and Hindlll and rdigition. 

25 Constructs #1187, #1188 and #1189. respectively, were made by "insertion of parts of 
the PEBP2oc coding region, as described in Ogawa a al. (1993) Proc. Nad. Acad. Sri. 
USA 90:6859-6863, as a Sacl/Wndlll DNA fragment, an AsplWHindlU DNA 
fragment or a NcoI/HindlU DNA fragment, respectively, into construct #1 152 cleaved 
with Sail which was bluru-enoed and Hindlll. Construct #1159 was made by insertion 

30 of the luciferase gene isolated from piasmid pT3/T7-Luc (Oontech, Palo Alto, CA) as 
a Sall/Asp7IS fragment into construct #1152 cleaved with the same enzymes. Results 
are shown in FIG. 4. 
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FIG. 4 illustrates that collision, inhibition, or AP expression is dependent on 
the spacing between the HIV- 1 LTR and the reporter gene, or alternatively, on the 
^pacing between the first promoter and the second promote-. In the presence of 0.5 ug 
of Tat expression plasmid, AP activity in constructs with spacing from about 21 
5 nucleotides to about 556 nucleotides is between 40% and 46%, and thereafter, as the 
spacing increased to about 2047 nucleotides, the alkaline phosphatase activity 
increased, indicating a certain space requirement for collision. Transient transactions 
were done using 1 fig of collision construct DNA and 0.5 ug of either active or 
inactive Tat expression plasmid. These results illustrate that collision is dependent on 
10 the spacing between the HIV-1 LTR and the first regulatory sequence that included the 
reporter gene. 

Example 6 

HeLa Cells Stably -Transfected with Tat Protein Expression Plasmid 
15 HeLa cells were transfected with 10 ug of either active Tat expression plasmid 

or with Tat-tnactive plasmid together with 1 fig of plasmid pSVNeo (purchased from 
Gontech, Palo Alto, CA ) for selection by ctectroporation using a BioRad Gene Pulser 
(Purchased from BioRad, Hercules, CA). Electropo ration was conducted in 
accordance with the manufacturer's instructions. For example, the conditions for 
20 electroporadon are 1000 uF and 300 volts at room temperature in a final volume of 

500 ul medium containing 10% fetal calf serum, and 50 fig/ml each of penicillin and 
streptomycin. Stable Tat expression colonies were identified by resistance to 400 
ug/ml gentamicin (purchased from GIBCO BRL) after about 10 to 14 days. Stable 
colonies were picked, amplified, and analyzed for Tat protein expression by 
25 transfection with construct #1085, the plasmid with an HIV-1 promoter linked to the 
AP reporter gene. Positive Tat-expressing cell lines were identified by measuring 
alkaline phosphatase activity, according to directions described by the manufacturer of 
pSVNeo (Oontech, Palo Alto, CA). 
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The following assay is designed to screen for inhibitors of the Tat protein, the 
Tat/TAR interaction or any other HIV- 1 promoter target. The inhibition sought is 
identified by significant inhibition of long terminal repeat (LTR) activity. Cell lines 
are stably transfected as described in Example 6 above with the collision construct 

5 #1152 and the Tat plasmid. Those stabry transfected cell lines (hat produce a steady- 
state alkaline phosphatase activity of about - 30% to 40% compared to control HeLa 
cells stably transfected with only die collision construct are selected for screening. 
Screening for inhibitors is conducted as follows: inhibitors are introduced into the 
culture medium and, after about 16 to 20 hours, the supernatant is analyzed for 

10 alkaline phosphatase activity, as described in Example 6. Using a 96-wdl assay plate, 
for example, 12 different inhibitors, at 8 different concentrations, are tested 
simultaneously. Those inhibitors that produce an increase in alkaline phosphatase 
activity are further characterized by transient transfection experiments. Transient 
transfections are conducted as described in Example 2. For example, HeLa cells are 

15 transiently transfected with construct #1 152 and a controlled amount of Tat plasmid in 
the presence of an inhibitor. The inhibitor is then tested for dose responsiveness to Tat 
by titrating the inhibitor or the Tat expression plasmid. Separate transient transfection 
assays are repeated for each inhibitor selected by the screening process described 
above. 

20 

I 

Example 7 
Constructs Including T AR Decoys 
To prove the functionality of the collision construct in identifying inhibitors of 
25 HIV-1 transcription by an increase in AP reporter gene activity, we obstructed vectors 
containing multirnerized TAR sequences (a schematic of which is indicated in FIG 6). 
Overexpression of TAR-containing sequences, referred to as TAR decoys have proved 
sufficient at inhibiting HIV-1 promoter activity by squelching Tat-mediated 
transacuvatkm. as described in SuUenger ct oL Cdl 63: 601-608 (1990) and Graham 
30 and Mak>, Proc. Hart Acad Sd USA 87: 5817-5821 (1991). The ability of Tar decoys 
to block HIV-1 transcription was previously analyzed with an HIV-AP construct in 
which the reporter gene was placed under the regulatory control of the HIV-1 
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promoter, as schematically represented in FIG 6a. Transection of the HIV-AP 
plasmid together with a Tat expression vector into HeLa cells showed approximately a 
45-fold stimulation of reporter gene expression relative to the level detected in the 
absence of Tat. Addition of the TAR decoys almost completely abolished the 
5 transacti vation function of TaL Next, the ability of the TAR decoys to inhibit HIV-1 
transcription in the context of the collision construct, as represented schematically in 
FIG. 6b. The results showed that the addition of TAR decoys blocked the potential of 
Tat to repress reporter gene expression by counter-transcription. At high TAR decoy 
concentrations, AP activity was even higher than the level detected in the absence of 

10 Tat, as shown in lanes 4 and 5 of FIG. 6b. These data suggest taht the TAR decoys 
sequester not only Tat but also other cellular factors that bind to the TAR sequence. 
This result is further supported by die result schematically depicted in a portion of FIG 
1 that indicated that AP values detected with a ATAR construct are similar to the 
values obtained with the wild type collision construct in the presence of the TAR 

15 decoys. 

The plasmid pBJ-TAR8 was constructed by insertion of multimerized TAR 
seq uen ces isolated from pHIVSCAT with Xbal and Hindlll restriction enzymes 
(nucleotides -40 to +59) downstream of the Sra promoter in pBJ, as described in 
Takebe ex al Mai Cell. Biol. 8: 466472 (1988). As a control plasmid, the leptin 
20 gene, as described in Giese et al Embo J. 12:4667-4676 (1993) with a similar size 
compared to the TAR sequences was inserted into the pBJ vector. 
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WHAT IS CLAIMED IS: 



1. A collision construct comprising a nucleic acid molecule thai comprises: 
a) a first regulatory sequence that comprises a first promoter, 

5 b) a reporter gene that is under transcriptional control of the 

first promoter; and 
c) a second regulatory sequence that comprises a second 

promoter; 

wherein the first promoter Is different from the second promoter, and 
10 directum of transcription under the first promoter is opposite to direction of 
transcription under the second regulatory sequence, and 

wherein regulation of the second promoter alters reporter gene signal. 



2. The collision construct of claim 1, wherein the second promoter 
15 comprises a first response element that is capable of specifically binding to a first 
binding protein to form a first binding pair; and formation of the first binding pair 
regulates the second promoter under transcription-regulating conditions. 



3. The collision construct of claim 1, wherein the regulation of the second 
20 promoter is achieved by activation thereof. 

4. The collision construct of claim 1, wherein the second promoter 
comprises a 5' terminus and a 3* terminus and the reporter gene is separated from 3' 
terminus of the second promoter by a distance of less than about 2047 nucleotides. 

25 

5. The collision construct of claim 4, wherein the distance is in a range 
selected from the group of ranges, in nucleotides, of about 1-50, 51-100, 101450, 
151-200, 201-300, 301-400, 401-500, 501-600, 601-1000. 1001-1500, 1501-2200. 



30 



6. The collision construct of daim 4, wherein the distance is in a range 
selected from the group of ranges, in nucleotides, of about 1-20, 21-40, 41-60, 61-30, 
81-100, 101-120. 121-140, 141-160, 161-250, 25M25, 426-550. 
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7. The collision construct of claim 4, wherein the distance, in nucleotides, 
is selected from the group consisting of about 21, 94, 153, 406, 556 and 2047. 

5 8. The collision construct of claim I, wherein the first promoter is a 

minimal promoter. 

9. The collision construct of claim 1, wherein one or both of the first and 
second promoters are each selected from the group consisting of promoters derived 

10 from a vims, a bacteriophage, a prokaryotic gene, and an eukaryotic gene. 

10. The collision construct of claim 9, wherein the virus is selected from the 
group consisting of a retrovirus, a vaccinia virus, a herpes virus, a hepatitis virus, a 
papilloma virus, an adenovirus, and an adeno- associated virus. 

15 

11. The collision construct of claim 1, wherein the first promoter is selected 
from the group consisting of a Xpl promoter, a Xm promoter, a prokaryotic ribosomal 
RNA P1/P2 promoter, a Rous Sarcoma Virus promoter, a Simian Virus 40 promoter, 
a simian immunodeficiency virus promoter, an albumin promoter, a Ick promoter, and 

20 zfas promoter. 

12. The collision construct of claim 9, wherein second promoter is a 
promoter derived from a virus and the virus is selected from the group consisting of a 
cytomegalovirus, a herpes simplex virus, a hepatitis vims, and a human 

25 irnmunodeficiency vims. 

13. The collision construct of claim 9, wherein the second promoter is 
selected from the group consisting of a /If promoter, a CD4 promoter, and a 0-3 
promoter. 

30 

14. The collision construct of claim 1, wherein one or both of the first 
regulatory sequence and second regulatory sequence comprise a synthetic sequence. 
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15. The collision construct of claim 14, wherein the first regulatory 
sequence comprises a synthetic sequence and the synthetic sequence is selected from 
the group consisting of a muidmeric Gal4 binding site linked to a minimal promoter 
and a LexA binding site linked to a minimal promoter. 

16. The collision construct of claim 14, wherein the synthetic sequence 
comprises a TATA box. 

17. The collision construct of claim 1, wherein the first promoter comprises 
a second response element that is capable of specifically binding to a second binding 
protein to form a second binding pair, wherein formation of the second binding pair 
regulates the first promoter under transcription-regulating conditions, and the second 
binding protein is incapable of specifically binding to the first promoter. 

» 

18. The collision construct of claim 1, wherein the reporter gene is selected 
from the group consisting of genes encoding alkaline phosphatase, luciferase, 
chloramphenical acetyl transferase, P-galactosidase, (^-glucuronidase, and green 
fluorescent protein. 

19. The collision construct of claim 2, wherein the first response element is 
derived from a promoter or promoter/enhancer region of a gene selected from the 
group consisting of a viral gene, a bacteriophage gene, a prokaryotic gene, and an 
eukaryotic gene. 

20. The collision construct of claim 17, wherein the second response 
element is derived from a promoter or pnmioter/enhancer region of a gene selected 
from the group consisting of a viral gene, a bacteriophage gene, a prokaryotic gene 
and an eukaryotic gene. 
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21. The collision construct of claim 2, wherein the first response element is 
one selected from the group consisting of a transactrvation response element ("TAR"), 
Rev response element ("RRE"), a NFhcB binding site, and a Spl binding site. 

5 22. The collision construct of claim 17, wherein the second response 

element is one selected from the group consisting of a transactivation response element 
("TAR"), Rev response element ("RRE"), a NF-tB binding site, and a Spl binding 

site. 

10 23. The collision construct of claim 2, wherein the first binding protein is 

one selected from the group consisting of Tat, Rev, NF-kB, and Spl. 



24. The collision construct of claim I , wherein die first promoter has a 
strength of transcription that is approximately the same as that of the second promoter 
IS upon activation. 



25. A vector comprising the collision construct of claim 1, further 
comprising a nucleotide sequence that allows for expression of the collision construct 
in a host cell. 

20 

26. A host cell comprising the vector of claim 25. 

27. The host cell of claim 26, wherein the host cdl is capable of amplifying 
the collision construct or effecting the expression thereof. 

25 

28. The host cell of claim 26, wherein the cdl is selected from the group 
consisting of a prokaryotic cdl and an eukaryooc cell. 



29. The host cdl of claim 27, wherein the cdl is capable of amplifying die 
30 collision construct and is a prokaryotic cdl. 
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30. The host cell of claim 27, wherein the cd! is capable of effecting the 
expression of the collision construct and is an eukaryodc cell. 

31. A kit comprising the collision construct of claim 1, further comprising 
5 instructions for use thereof. 

32. A kit comprising the vector of cham 25, further comprising instructions 
for use thereof. 

10 33. A kit comprising the host cell of claim 26, further comprising 

instructions for use thereof. 



15 



20 



34. A method for screening a candidate inhibitor for its ability to inhibit 
transcription under a target promoter comprising: 



a) 



b) 



c) 



providing a cell that comprises the collision construct of 
claim 1, wherein the second promoter is the target promoter; 
determining reporter gene signal in the presence and absence 
of the candidate inhibitor; and 
comparing reporter gene signals obtained to determine 
whether inhibition of transcription under the second 
promoter occurred in the presence of the candidate inhibitor. 



25 



35. The method of claim 34, wherein the target promoter is not endogenous 
to the ceil. 



36. The method of claim 34, wherein the target promoter is endogenous to 



the cell. 



37. A method for screening a candidate inhibitor for its ability to inhibit 
30 binding between a target binding protein and a target response dement comprising: 

a) providing a cdl that comprises the collision construct of 

claim 2, wherein the first binding protein is the target 
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binding protein and the first response element is the target 
response dement; 
b) providing a basdine reporter gene signal in the absence or 

presence of die target binding proton; 
5 c) d ete r mining reporter gene signal in the presence and absence 

of the candidate inhibitor; and 
d) comparing reporter gene signals obtained. 

38. The method of claim 37, wherein the target response dement is not 
10 endogenous to the ceil. 

39. The method of claim 37, wherein the target response dement is 
endogenous to the cell. 

15 40. The method of claim 37, wherein the target binding protein is provided 

by a process selected from the group consisting of: 

a) introducing into the cell a nucleotide sequence that encodes 
the target binding protein; 

b) allowing a cdl that is capable of producing the target 

20 binding protein consritutivdy to produce the target binding 

protein; and 

c) adding the target binding protein to the cdl. 

41. A method for identification of an inhibitor of transcription under a target 
25 promoter comprising: 

a) providing a ceil that comprises the collision construct of 
claim 1, wherein the second promoter is the target promoter, 

b) determining reporter gene signal in die presence and absence 
of a panel of candidate inhibitors; 

30 - c) comparing reporter gene signals obtained to determine if 

inhibition of target promoter activity has occurred in the use 
of any one of the pand of candidate inhibitors. 
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42, A. method for making a reporter collision construct, comprising: 

a) providing a first regulatory s eq u ence that comprises a first 
promoter, a reporter gene that is capable of providing a 

5 detectable signal upon transcription and translation, and a 

second regulatory sequence that comprises a second 
promoter; and 

b) linking the first regulatory sequence, the reporter gene, and 
the second regulatory sequence together to produce the 

10 collision construct of claim 1 . 



43. A method for production of a collision construct, comprising culturing 
the host cell of claim 33. 

15 44. A collision construct produced by a process comprising expressing the 

collision construct of claim 1 in a prokaryotic or eukaryouc cell. 

45. The collision construct of claim 44, wherein the eukaryouc cell is 
selected from the group consisting of a mammalian cell, an insect cell, a yeast cell, and 
20 an avian cell. 
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ARGUMENT MAP IN DMA STRAND 4-1161 

fron the '/arp/||b/6mers' file. 

Mil 1 1 1 1 « 1 1 1 M II I j | Ml 

KASl SAL I HJTO3 SPHI SWT SCX1 KPN1 

WAR I XBAI AFL2 PVU2 BSFE1 SACI 

NAEI BANHI SACI XBAI ECORI 

SKA I BSAB1 B6L2 XKAI 
WW I HIKD3 
HPAI SPHI 

SSE83871 
PSTI 
SALI 
XBAI 

^Bl AP-STOP 
KAB1 COLON 



1 CTGCGACTGGCGaCCCaC^CCACC6AC6CCa 

GACGCTGACCGCGGGGGCG6CC6TG6TG6CT6C6GCGC6TGGGCCCAATT6GGCACCA6G 

~ A A 

9 KASl NARI. 18 NAEI, 41 SNAI XKAI. 46 HPAI. 

61 CCGCGTTGCnCCTCJGCT^ 

GGCGCAACGAAGGAGACGACCGGCCCTGTAGTCCACCGGGG6CGACTTAACCTTAGCAGC 

A 

117 SALI # 

121 ACTCTAGA6GATCCCCATCM6CTTGCAT6CCT6CAG6TC6ACTCTAGAGGATCCCCATC 
TGAGATCTCCTA6GGGTAGTTCGAACGTACGGACGTCCA6CT6AGATCTCCTAGG6GTAG 

A A A A A A A ^ A A A 

123 XBAI. 129 BANHI. 130 BSAB1. 140 HIN03. 146 SPHI. 151 SSE 
83871, 152 PSTI. 158 SALI, 164 XBAI, 170 BANHI. 171 BSAB1, 

181 AAGCTTTATT6AG6CTTAAGCAGTGGGTTCCCTA6TTAGCCA6AGAGCTCCCA66CTCAG 
TTC6AAATAACTCCGAATTCGTCACCCAAGGGATCAATC66TCTCTC6AGGGTCCGA6TC 

* A A 

181 HIND 3-195 AFL2 . 225 SACI. 239 BGLD. 

24 1 ATCTGGTCTAACCAGA6A6ACCCAGTGCAT6C AAAAA GCAGCT6CTTATAT6CAGCATCT 
TAGACCAGATTSGTCTCTCT6GGTCACSTACGTTTnCGTCGAC6AATATACGTCGTA6A 

. IT LC 

267 SPHI. 279 PVU2. 298 XBAI, 

301 AGAGG6CAC6CCAaCCCCAGTCCC6CCa6G(XAC6CCTCCC6GGAAAGTCCCCAGCGG 
TCTCCCGTGC6GTGAGGGGTCAGGGCGGGTCC6GTGC6GA6G6CCCTTTCAGGGGTC6CC 

Insertion * 4A 

341 SNAI XKAI, 

361 AAAGTCCCnG6A6AAAGCTC6AT6TCAGa6TCTTT6TAGTACTCCGGATGCAGCTaC 
mCA6GGAAC(TCmC6AGaAaGTC6TCAGAAACATCAT6A6GCCTAC6TCCA6AG 

A A 

400 SCAI. 405 BSPE1. 

421 66GCMT6TGAT6AMT6CTA6TTT^ 

Ct(^GTACACTACTTTAC6ATCAAACGACAGTTTG6AGGT6TGATTGT6AAGAAA6AG6C 

481 CGTCCTCCATCCCAT6CAG6CTCATA6G6T6TAACAA6CT6TT6TTCTCTCCnCATTGG 
GCA66A6GTAGGGTACGTCCGAGTATCCCACATTGTTCGACAACAAGA6AGGAAGTAACC 

541 CCTCnCTACCnaCTGGCTCAACTGGTAqAGCrTGAAGCACaTCCAAAGGTCAGTG 
GGAGAA6ATG6AAGAGACCGAGn6ACCAT6ATCGAACTICGTG6TAG6TTTCCAGTCAC 

601 6AT6G6TACCGA6CTC6AATTCCCTATA6T6A6TC6TATTAAATTCGTAATCA 
CTACCCATGGCTCGA6CTTAAGG6ATATCACTCAGCATAATTTAAGCATTAGT 



605 KPNI. 611 SACI, 617 ECORI. 
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